Figure 1 

Generate Question Phrases from Questions -\ 
in Training Data 

Generate Candidate Transforms from Answers 

in Training Data \ 

Evaluate Candidate Transforms for each 
Search Engine 

Output Best Transforms for each Search Engine J 



Figure 2 



Question Phrase (s) 

"who was" 
"how do i" 
"where is", 
"where can i" 
"what are" 
"what is" 
"what is a" 



Figure 3 



A what (is | are | were | does | do | did | should | can) \s 

A who (is I are | was | were | did | do | does) \s 

A how (to ] is | do | did | does | can | would | could | should | ) \s 

A why (is | do | are | did | were | does) \s 

A where (is | was | can | are | were | do | does) \s 

A when (is | was | are | were | do | did | does) \s 

A which \s 
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Figure 4 



Question Phrase 


Candidate Transforms 




"the term" 




"component" 




"ans" 


"what is a" 


"a computer" 




"telephone" 




"collection of 




"stands for" 




"unit" 
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Figure 5 



Question Phrase 


Candidate Transform 
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"refers to" 
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"refers" 
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"meets" 
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"what is a" 


"driven" 
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"named after" 
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/to describe" 
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Figure 6 



Transform Length 


Candidate Transform tr, 


WtTj 


3 


"is used to" 
"according to the" 
"to use a" | 




32.89 
23.49 
21.43 
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"is a" 
"of a" 
"refers to" 




298.89 
94.34 
81.3 
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"usually" 

"used" 

"refers" 
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Figure 7 

71 

(1) Examples = RetrieveExamples (QP, numExamples) 

for each <Question, Answer> in Examples 

for each candidate transform trt 72 

(2) Query = ApplyTransform (Question, tr,) 

(3) Results = SubmitQuery (Query, SE) 

for each Document in Results 73 

docScore = -1 74a 
(4a) Subdocuments = getSubDocuments (Document, 

subDocLen) 

for each SubDocument in SubDocuments 
(4b) tmpScore = DocumentSimilarity (Answer, SubDocument) 

if (tmpScore > docScore) docScore = tmpScore \ 

(4c) updateTransformScores (tr h docScore)^-" 740 \ 

updateTransformCounts (tri) 7 ^ 

^75 

(5) AssignTransform Weights (TransformScores, TransformCounts) 
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"is usually" 


377.03 




"is usually" 


280.68 




"refers to" 


373.22 




"usually" 


275.68 


AltaVista 


"usually" 


371.55 


Google 


"called" 


256.64 




"refers" 


370.14 




"sometimes" 


253.53 




"is used" 


360.07 




"is one" 


253.24 



Figure 9 



(la) QP = matchQuestionPhrase (Question)^" 91b 
(lb) {tr} = retrieveTransforms (QP, numTransforms) 

for each tr t m{tr) 92 

(2) Query = ApplyTransform (Question, tr^ 

(3) Results = SubmitQuery (Query, S£)~^^ 93 
for each Document in Results 

maxScore = Maximum Score for this Document so far 94a 
( 4a ) Subdocuments = getSubDocument (Document, subDocLen^^ 

for each SubDocument in SubDocuments 94b 
( 4 ^>) Score = DocumentSimilarity (Query, SubDocument) 

( 4c ) if (Score > maxScore) Update maximum Score of Document to Score 

(5) RankedDocuments = Sort Document in decreasing order of document Score ^ 

(6) Return topKdocuments with highest Score^ \^ 

96 95 



